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The definition of the fundamental quantity, the chemical potential, is badly confused in the 
literature: there are at least three distinct definitions in various books and papers. While they 
all give the same result in the thermodynamic limit, major differences between them can occur 
for finite systems, in anomalous cases even for finite systems as large as a cm^. We resolve 
the situation by arguing that the chemical potential defined as the symbol /i conventionally 
appearing in the grand canonical density operator is the uniquely correct definition valid for all 
finite systems, the grand canonical ensemble being the only one of the various ensembles usually 
discussed (microcanonical, canonical, Gibbs, grand canonical) that is appropriate for statistical 
thermodynamics, whenever the chemical potential is physically relevant. The zero-temperature 
limit of this /i was derived by Perdew et al. for finite systems involving electrons, generally 
allowing for electron-electron interactions; we extend this derivation and, for semiconductors, we 
also consider the zero-T limit taken after the thermodynamic limit. The enormous finite size 
corrections (in macroscopic samples, e.g. 1 cm^) for one rather common definition of the c.p., 
found recently by Shegelski within the standard effective mass model of an ideal intrinsic semi- 
conductor, are discussed. Also, two very-small-system examples are given, including a quantum dot. 

Key words: statistical thermodynamics, insulators, thermodynamic limit, quantum dots. 
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I. INTRODUCTION 

This paper deals with the conceptual problem of defin- 
ing the fundamental quantity, the chemical potential, in 
statistical mechanics. Understanding of this matter is im- 
portant in many places in physics and chemistry. That 
the account in the present literature is badly confused 
will become clear just below. To gain this understand- 
ing, I have found that the relation between equilibrium 
statistical mechanics and thermodynamics has to be re- 
considered. 

Two distinct issues are discussed here. One is the def- 
inition of the chemical potential, which I will call c.p. 
until the question is resolved. The other is the question 
of the accuracy of the thermodynamic limit for a piece 
of a semiconductor as large as a cm'^. We begin with a 
specific example, an ideal semiconductor, that serves to 
introduce both issues; later, other quite different, models 
and related experiments are used to illustrate the issue 
of definition in connection with very small systems. 

Recently, Shcgclski^-^ defined the quantity 



(j){N) =F{N + 1)-F{N), 



(1) 



as being the c.p. of an iV-electron system. Here F is the 
Helmholtz free energy, as calculated in the canonical en- 
semble, and the difference is taken at constant tempera- 
ture T and volume V.^ Within the usual non-interacting- 
electron model (NEM) of an intrinsic semiconductor with 
N electrons^i^, he found the following: The behavior of 
(l){N) as a function of T differs appreciably, at moder- 
ately low T, from the usual result"*'^ for the chemical 
potential, even for systems with volumes the size 
of the universe!, although in the thermodynamic limit 



{V,N oo,N/V fixed) (j){N) the usual result for all 
T > 0. He finds (j){N) = ec at T = for any finite volume 
V, where Ec is the energy at the bottom of the conduc- 
tion band. In contrast, the thermodynamic limit (TL) 
of (f>{N) (ec + ey)/2 as T — > 0, where e„ is the energy 
at the top of the valence band; i.e. the thermodynamic 
limit of 0(iV) approaches the middle of the band gap, as 
does the usual result. Shegelski's result behaves qualita- 
tively exactly as one would expect without calculation: 
At T = 0,F{N) = E{N), the ground state energy for 
N particles, so that for the model considered (j){N) — 
as easily shown from the expression for E{N). And at 
T > (seei and Appendix B in^)^, in the TL 



0(iV) 



d{F/V) _ dF 
d{N/V) ~ dN' 



(2) 



the derivative taken at constant T. {F/V is the limiting 
function^.) Thus this is the usual result, approaching 
the middle of the gap as T 0. The equality is formal, 
presented to exhibit the last form as the usual definition 
of the c.p. in thermodynamics^, where TV is assumed to 
be continuous. What is surprising and most interesting 
are the enormous finite-size corrections. Such a scenario 
where the TL cannot be essentially reached with a cm'^ 
sample is certainly discomforting, but in light of Shegel- 
ski's results, clearly must be considered, as done below. 

These results force extra care with terminology. I will 
use the term 'macroscopic' to mean systems with roughly 
Avogadro's number of particles, to within a few factors of 
10 either way, to be contrasted with the TL. I will mean 
by 'finite systems' those with the number of particles 
ranging from a few (say 1,2, ...100) up to macroscopic 
size. I will also use the term 'small systems' to denote 
those with a few particles. 
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First I note that the identification of (f) with the c.p. 
is found elsewhere, e.g in the book by Ashcroft and Mer- 
min."'' Landsberg and Brownei^i also apparently accept 4> 
as the c.p.: in reference to Shegelski-'- they state "it is con- 
firmed that the Fermi level of an intrinsic semiconductor 
goes to the bottom of the conduction band as T 0." 
But they note that "however the loss of one electron is 
sufficient to make it go to the top of the valence band, 
and for some purposes it may be adequate to take an av- 
erage of the two values" . Evidently they are identifying 
the c.p. with the Fermi level, and they seem to be saying 
that the c.p. is not a uniquely defined quantity. 

Another definition is given in the book by Kittel and 
Kroemenii, namely the c.p. for an A^-particle system is 
identified with 

i,(N)~F{N)~F{N (3) 

this being the top of the valence band at T in 
the NEM. Fetter and Walecka^^ gj^g bo^j^ ^ Q 
at T = (they confine themselves to the TL, but as 
we have seen, these differ for our model semiconductor, 
even in the TL). HiHA^,, considering general T and small 
systems, gives ^ but notes that would be just as 
good a choice. Chaikin and Lubenskjii^ define the chem- 
ical potential, in the context of thermodynamics, as the 
change in internal energy produced by addition of one 
particle, apparently supporting Baierleini^ gives c.p. 

Perdew et aU^J^ showed for systems with Nq electrons 
that the zero-T limit of the quantity v that appears in 
the grand canonical density operator, 

p = exp[~f3{H - vN)]ltr exp[~(3{H ~ vN)], (4) 

is 

u{No) = ^[E{No + l)-EiNo-l)], (5) 

where (3 = l/{kT), and A'o is the average value of the 
number of particles, taken to be integrali^. See alsoi^. 
Since v is also quite generally called the chemical poten- 
tial, we clearly have another definition of the c.p. It is 
the average of and ip at T = 0, mentioned ini£. 

For metals, the various formulas give essentially the 
same result for N ~ 10^'^. It is for small systems, (e.g. 
a Na atom or a quantum dot) and for semiconductors 
that one needs to be concerned about the difference be- 
tween them. It is well known (see e.g. refi^) that in 
the TL, within the NEM, the average occupation num- 
ber of a 1-electron state with energy e is the Fermi-Dirac 
distribution 

<n,>=f{e-iy) = [exp(3{e-u) + l]-\ (6) 

In the well known and very simple derivation of 10) via 
the grand canonical ensemble, the quantity v is the same 
as that appearing in Q), © holding independently of sys- 
tem size. It is also well known that, as we have said, in 



the thermodynamic limit taken at T > 0, within the 
NEM, ly, (the quantity usually actually calculated as the 
c.p. in the NEM), approaches ©, which is the middle 
of the gap for the semiconductor model, as T — > (see, 
e.g.-M). But we need to be concerned about the role of 
finite size, as we said. 

Note that ((SJ can be written 

iyiNo) = -^[I{No)+AiNo)], (7) 

where 

/(iVo) = E{No ~ 1) ~ E{No) 

A{No) = EiNo) - E{No + 1) (8) 

are respectively the ionization energy and electron affin- 
ity of the A'o-electron system at T = 0. Parr et al^-'^ note 
that this is closely related to a formula due to MuUikenSS, 
who spoke of the electronegativity (e.n.) rather than the 
chemical potential, and gave a formula for e.n. as the 
negative of 0. See also Gyftopoulos and HaftopoulosSi 
who simply define e.n. as -v (arguing that "since v is 
interpreted as the escaping tendency (the opposite of the 
power to attract) of a component from a thermodynamic 
system, it is reasonable to use its negative as a measure 
of electronegativity"), and^SiiLSi for related discussion. 
It appears that the analysis inp2S is approximate, involv- 
ing tight-binding molecular orbitals and Hartree-Fock ap- 
proximation; whereas jSJ) was proved more generally (also 
see below). 

Thus we have several distinct definitions for the chem- 
ical potential appearing in the literature. For mathemat- 
ics only, one can take his/her pick. However, there is an 
important piece of physics that removes this arbitrari- 
ness, in my opinion. Namely, the condition for equilib- 
rium with respect to particle flow between two systems 
at the same temperature is that their chemical potentials 
be equal, the chemical potentials in this context being 
defined as the values of v appearing in their respective 
distributions (see e.g. TolmanS^). Thus, thinking in 
terms of experiment, equilibrium, i.e. zero current, will 
occur e.g. between a metal and a semiconductor in con- 
tact and at the same T when i/ for the semiconductor 
equals that in the metal. In standard thermodynamics, 
it is the relation (O, c.p. — dF/dN, plus minimization 
of the total free energy, that leads to this fundamental 
equilibrium conditioni^ 

In Section II, I argue that the grand canonical ensemble 
is the proper basis for a theory of statistical thermody- 
namics, and correspondingly, it is v that is uniquely the 
appropriate meaning of the c.p. The argument is sup- 
ported by two very-small-system examples, a quantum 
dot and a pair of atoms contemplating charge transfer. 

In Section III, I discuss the zero-T limit of ly, both for 
finite systems, and, in the case of solids, whether the limit 
is taken before or after the TL. Section IV describes a 
simple calculation of v for a finite semiconductor within 
the same NEM discussed above, with attention to the 



3 



approach to the TL. It is shown that this quantity does 
not show the strange huge finite-size effects found for 
0, V being very close to the TL for macroscopic size, 
at all T. In the Appendix I give a somewhat simpler 
calculation than Shegelski's which supports his finding of 
a huge finite-size effect (this shows up for v(Nq + 1), iVo 
corresponding to the neutral semiconductor). 

As in the relevant references citedi'Siii£iili~iiSiiSiiij we 
will be considering the very common one-component 
model where the electrons move in a fixed one-electron 
potential. However, the considerations of Sections II and 
III, with the exception in the latter of the quantum dot 
model, are more general: They allow for the adjustment 
of the one-electron potential as one adds or removes elec- 
trons. E.g., in the hydrogen molecule, the equilibrium 
separation of the protons will differ for the neutral and 
for the charged cases, such adjustments being allowed for 
in the formalism; a given molecule in the usual chemical 
sense has a given number of nuclei with different degrees 
of ionization. Alternately, when we consider the thermo- 
dynamic limit of a solid, we add nuclei as we add electrons 
in the customary way: in calculating the TL of F{N) for 
the semiconductor, the net charge is kept constant (at 0); 
similarly, for F{N + 1), the net charge is again held con- 
stant (at 1 electronic charge). Also, the generalization to 
multi-component systems of the final choice of definition 
of the c.p. is mentioned in the next section. 

I mention here that in Section IV (Final Remarks), I 
emphasize the fact that the ideal intrinsic semiconductor 
is quite special (anomalous), and although it is clearly 
of considerable pedagogical importance, the very-small- 
system examples are of more physical interest. 



II. RECONSIDERATION OF STATISTICAL 
THERMODYNAMICS 

The usual definition of F is via the canonical ensemble: 

F{N) = -kTlnZciN) (9) 
Z,{N) = Y,^M-PEj{N)], (10) 



where j = 1, 2, • ■ • and Ei{N) < E2{N) < ■ ■ ■ are the 
energy eigenvalues for states of N particles. We have 
assumed that the Hamiltonian and the operator N for 
the number of particles commute. N is of course dis- 
crete, so that differentiation with respect to N is not 
defined. If an analytic expression for F{N) were known, 
then, for large N, continuing the function to continu- 
ous N would make sense; e.g. if F{N) = N + InN, 
then (f){N) ^1 + ln{l + l/N), which approaches dF/dN 
for large N. But generally such knowledge is not avail- 
able; while interpolating in some smooth way between 
the integer values might be satisfactory, this entails some 
loss of rigor, and might be troublesome, particularly for 
small systems. Surely this has been the motivation be- 
hind defining the chemical potential in terms of finite 



differences of F{N) as in |^ and A similar situ- 

ation exists for the other commonly discussed distribu- 
tions, microcanonical, or the Gibbs distribution (fixed T 
and pressure) , with the exception of the grand canonical 
ensemble. In the latter case, N is merely a summation 
variable, the role of the number of particles is played by 
the average value of N, 



N = 



(11) 



which is a continuous variable at T > 0. 

In standard statistical thermodynamics, each of the 
various distributions gives rise naturally to a correspond- 
ing thermodynamic potential or free energy. E.g., the 
canonical ensemble gives directly the Helmholtz free en- 
ergy: defining the entropy as 5 = —ktrpclnpc where 
Pc —exp{—j3H)/Z(., and the trace is taken over states 
with fixed number of particles iV, one sees easily that 
F = U — TS {U = average energy). And F comes out 
directly as a function of T, V, N . In thermodynamic the- 
ory^, knowledge of F{T, V, N) as a function of the vari- 
ables indicated, the natural variables^ for F, is a fun- 
damental equation; that is, from it one can calculate all 
thermodynamic properties, including other free energies 
and the chemical potential. Similarly, the grand canoni- 
cal ensemble gives the grand canonical free energy, called 
the thermodynamic potential by Landau and Lifschitz^— , 



n = -fcrin z. 

Z = tr exp[-l3{H - i^N)] 



(12) 
(13) 



is the grand canonical partition function, which is given 
directly in terms of the variables T, V, v, the natural vari- 
ables for n. (The trace here differs from that for the 
canonical ensemble in that here it involves summing over 
energy states with various numbers of particles.) In this 
case the entropy is S — tr p In p where p is the grand 
canonical density operator 10}. Using this we see that 



where 



n = U-TS-vM, 
^ T-vM 



E,-^£;j(iV)e-^[^^(~)-''^] 



(14) 
(15) 



(16) 



The change in notation, e.g. ?7 — > W, is made to dis- 
tinguish between quantities calculated in the canonical 
and grand canonical ensembles. In the thermodynamic 
limit, such a distinction is unnecessary, but, again, we 
have to consider finite size effects. From the mathemat- 
ics of thermodynamic theory, knowing f2 as a function 
of T, y, I/, we can construct all thermodynamic quanti- 
ties. However, the corresponding Helmholtz free energy 
J-", internal energy lA and entropy S will differ from their 
corresponding quantities -F, [/, S by virtue of the different 
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ensembles being used in the two cases. In particular, T 
is calculated via ((T3|l : 



(17) 



The continuous nature of M allows differentiation 
of (fT7|l with respect to TV; doing this at constant T and 
volume and recognizing the A/'-dependence of v gen- 
erated by yields the identity 



dN 



(18) 



as is well known (see, e.giiia*). This is of course recog- 
nized as the historical thermodynamic relation between 
the chemical potential and the Helmholtz free energy. 
But note that it is exact even for finite systems; the TL 
is not required for its validity. 

Now consider the definition of the chemical po- 
tential. It is clear that choosing or would be en- 
tirely arbitrary, so neither is satisfactory. The average 
of these would be a guess, as would be either of them; 
i.e., the only basis for such guesses (apparently) is that 
they reduce to the thermodynamic definition in the TL, 
and of course there is an infinity of such guesses. The 
idea that the c.p. can be chosen in various ways ^"■^■^ is 
unacceptable, for reasons already stated. Thus, in light 
of H18|l . and the role of the chemical potential in particle 
transport as argued by Tolman, plus the rather universal 
understanding of this role (see e.gi^iiSSiS), I suggest that 
the proper definition of the chemical potential is 



c.p. 



fj.. 



(19) 



(I have now bestowed on this the symbol p, used almost 
universally for the chemical potential.) Given this defi- 
nition, most of the other standard thermodynamic rela- 
tions follow, i.e. all that don't depend on the assumption 
of extensivity of the "extensive" variables, or in other 
words the assumption that U{S, A/") is a homogeneous 
first order function. E.g., it is well-known that, along 
with lfn|) and ((TH|l . the relations 



where 



dU 



P ■ 



-S 

TdS + pdU ~ pdV, 



dV 



follow directly from the definitions given. But in gen- 
eral one does not have the relations that follow from the 
extensivity hypothesis^: 

U ^ TS-pV + pM 

n -pV 

G = T + pV^pN. 



Well, so be it-the physical information contained in the 
density matrix relating to the average "extensive" quan- 
tities is given only by the thermodynamic egualities listed 
plus other relations derivable from these. The inequal- 
ities listed are written by HilU^ through the definitions 
p = G/N,p= -n/V; then 



Of course p p, p p in the TL. 

Clearly the only statistical approach that allows the 
rigorous connection to thermodynamics, (|18|l . is through 
the grand canonical ensemble (among the usually con- 
sidered ensembles). It seems reasonable to conclude that 
for finite systams the grand canonical ensemble is the only 
legitimate choice from the point of view of statistical ther- 
modynamics, whenever the chemical potential is relevant. 
The view proposed is simply that all of statistical ther- 
modynamics where the chemical potential is physically 
relevant be carried out on the basis of the grand canoni- 
cal ensembleiSLSSiSS This then obviates the apparent ne- 
cessity to define the c.p., ad hoc, as some functional of a 
free energy defined only on the integers. I will work with 
this new view and the definition H19|) henceforth.'^" 

The above argument deserves further scrutiny. Tol- 
man's proof about equality of p being the condition of 
zero particle fiow between two systems assumes that at 
thermal equilibrium the energy eigenvalues are the sums 
of the eigenvalues of the two systems. This is essen- 
tially the same assumption made in regard to the sim- 
ilar question regarding the relation between temperature 
and heat flow (see alsoSS) . This assumption of additivity 
was justified because the individual systems being con- 
sidered were macroscopic. However, the assumption can 
be questioned for finite systems, or where finite-size "cor- 
rections" are being investigated, as is our main interest 
here. In this connection, it has been shown that under the 
appropriate conditions, the neglect of the interaction be- 
tween a reservoir and a small system, is justified for the 
purpose of considering thermal equilibrium^ The ap- 
propriate conditions are physically sensible requirements 
on the reservoir (beyond being large), and small enough 
reservoir-system interactions. These lead at equilibrium 
to a well-defined temperature T of the small system, this 
T, along with the concept of heat flow, having typical 
attributes of temperature for large systems, in the sense 
of energy exchange between the small system and the 
reservoir. 

We expect (and assume) that a similar result will hold 
for the chemical potential and particle flow. In support 
of that expectation, we consider two examples based in 
physical reality, (a) the case of quantum dots, and (b) the 
prediction of ionicity in diatomic molecules (and some 
related solids). 

(a) Experiments on quantum dots indicate strongly 
that under appropriate conditions, one can ascribe with 
high certainty whether there is one or two, etc. electrons 

on the dot.35i36,37 Q^g 

might worry that description by 
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a grand canonical ensemble, which gives fluctuations in 
the number of particles, which might be large for small 
systems, would therefore contradict these experimental 
results. Further, while I believe the physics of these ex- 
periments is well understood, there is confusion about 
the concept of the chemical potential of the dot, at least 
in some writings. The following extremely simple exam- 
ple shows that the grand canonical ensemble can answer 
the question about the fluctuations, and illustrates how 
a proper understanding of the chemical potential gives 
a picture consistent with experiments. We consider the 
single-site Hubbard model, 

H = -hn + Un^n^, (20) 

where U{> 0) is the electron-electron Coulomb interac- 
tion, rio- is the occupation number for a spin-ci electron, 
and n — n^' + ni. /i > is the ionization potential when 
there is 1 electron. This was used in previous considera- 
tions of a pair of quantum dots (plus an inter-dot hopping 
^gj.j-i^-j i38j39j4o Recognizing that 2n|n| — n? ~ n, one sees 
that it is essentially the capacitive term also used to rep- 
resent the so-called "charging energy" in many single-dot 
models4ii^ There are only 4 states in the model, with 
just 3 values of n, namely, 0,1,2. 

The grand canonical distribution readily yields the 
(grand) partition function Z = 1 + 2e'^'^ -I- e'^'^'^~^^ , 
the average number of particles < n >, and the mean 
square fluctuation < ri^ > - < n >^= kT^^^, 

shown in FIG. □ for the typical valuea^S*^ U = lOmeV, 
T = J7/20 — 6K; Ii has been put to 0, amounting merely 
to a shift in the zero of ^ (from -Ii). It is seen that 
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FIG. 1: Average occupation number < n > and its mean 
square fluctuations vs. fi in meV. {U — 10 meV.) 

plateaus occur at integer values of < n > , and for fi in the 
corresponding range, the fluctuations in n are negligible. 
Thus small fluctuations have been achieved because the 
temperature is low compared to U, rather than for the 
usual reason, large system size. The plateaus in < n > 
illustrate the Coulomb blockade, commonly discussed 
in connection with transport experimentS(^^i^i*^2ii^, but 
ggg36j37_ (ifif — 0, < 71 > would rise directly to the value 
2, the plateau at < ti >= 1 not occurring.) This effect 
is already well-known in atoms, it being the result of the 
Coulomb repulsion, manifested in the fact that the ion- 
ization potential exceeds the electron affinity (a picture 



for the lithium atom quite similar to < n > in Fig. 
appears inii). The present discussion is strictly within 
thermodynamic equilibrium, the relation with transport 
measurements being that the conductance of the dot is 
small in the plateau regions, and large in the transition 
regions. 




FIG. 2: Schematic for capacitance spectroscopy on a quantum 
dot. Color online. 

The basic idea of the experiments can be understood in 
some detail on the basis of this simple model, provided we 
recognize the fundamental physical meaning of the chem- 
ical potential regarding electron flow, as discussed above. 
One of the types of experiments, called single-electron ca- 
pacitance spectroscopj*^, is essentially an equilibrium ex- 
periment. It can be idealized by a system consisting of a 
metal lead connected to the dot through a tunnelling bar- 
rier, as indicated schematically in FIG.|21 The proximity 
of the dot to the lower lead (the reservoir) is meant to 
indicate that the barrier between the dot and that lead is 
small, the tunnelling time r being short, the barrier to the 
upper lead being very large, so tunnelling through that is 
negligible The potential of the dot relative to the lead 
is controlled by a gate voltage V, which has a DC and a 
small AC component. The energy of electrons on the dot 
due to this potential is — eV^n, Vd being the potential on 
the dot, proportional to V. (We assume negligible poten- 
tial variation over the dot. Also, e = magnitude of the 
electron charge.) On adding this term to the Hamilto- 
nian H20|) . one readily sees that < rt > is a function of /i 
and eVd only through the sum: < n >= g{^ + eVd)- Con- 
sider first the case where there is only a DC component. 
For a fixed V the system will come to equilibrium, so that 
/i = eF, the Fermi energy of the lead. For small V, ep 
will deviate from its zero-F value by a term linear in 
V, so that < n >= ^(e^^ -t- cV), c = constant. Hence, by 
varying V slowly (rate << 1/t, the equilibration rate), 
one can scan through the abscissa of FIG. The AC 
component is introduced to help detect the signal; its fre- 
quency is small compared to 1/r, so that at each instant 
the system can still be considered as essentially being in 
thermodynamic equilibrium. Hence fi = ep even in this 
case. Through this, the experiment directly measures 
the charging of the dot, i.e. ^ (the capacitance of the 
dot, whence the terminology capacitance spectroscopy), 
c>c This derivative is seen from Fig. ^ to consist 

of sharp peaks at the points where an electron is added 
to the dot, as observed and interpreted in leim^^. Since 
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the derivative is proportional to the fluctuations in n, the 
peaks in these experiments are directly proportional to 
those seen in the lower curve, Fig.^ although in the ex- 
periment, T is considerably lower than assumed for the 
figure, so the observed peaks arc much sharper. (Also, 
the experiment considers the number of electrons going 
up to remarkably larger values than 2, because the dot 
there has many more bound states than the model taken 
here for simplicity.)^ 

In references^Siii one finds the chemical potential of 
the dot defined at T = as fidot{N) = ip{N) above) 
and fiN = 4>{^) (O above), respectively. These def- 
initions present a problem, seen as follows. For our 
model ^ with /i = 0, 0(1) ^ U. At U on the hor- 
izontal axis, < 71 > is either undefined (at T — 0), or 
3/4 (in the limit T ^ 0, as seen in the next section). 
But presumably fii = 0(1) is supposed to be the chem- 
ical potential for particle number = 1, and hence it is 
either undefined or self inconsistent. A similar argu- 
ment holds for the other definition. In fact, at T = 0, 
0(7Vo) EE ^A{No) = limT^o Ai(A/'), for No<Af <No + l, 
where < n >= J\f and A^o is an integer (see Fur- 
thermore, even in the zero-T limit, there is not just one 
possible value of the chemical potential, as apparently 
implied by each of these works; (|32|l shows that there 
are three possible values of fJ.{Af) for Af in the range 
Nq — 1 < M < Ao + 1, two for non-integer A^, one for 
integer A/'(= A^o) . (This is clear from Fig.[Tlfor A^o = 1 if 
one imagines the step- function limit of < n > as T ^ 0.) 

(b) We consider the example discussed iniSiii, namely 
the question of charge transfer in typical "ionic" diatomic 
molecules, but with a bit more emphasis on the role of the 
chemical potential. If our molecule is BC, then at very 
large separation R we know empirically that for minimum 
energy, atoms B and C are neutral (since either / is > ei- 
ther A for neutral atoms). As R decreases, one considers 
the possibility of charge transfer. The direction of this 
transfer, if it occurs, is determined of course by which 
direction gives minimum energy; but I want to see how 
this depends on the chemical potentials of the atoms. Let 
the numbers of electrons on the respective neutral atoms 
be A^b, Nc- The energies for the two directions of transfer 
are, for large R, 

E{B^C) = Ei,{Ni,-l) + E,{N, + l) + v{R), 
E{B^C) = Et,{Ni, + l) + E,{N,-l) + v{R), 

where the leading term of the interaction energy v{R) is 
— e^/i?, but can also contain higher polarization terms; 
Ei{N) is the energy of the isolated atom i. Thus, us- 
ing ®, 

EiB^C)-E{B^C) = 2{fic~f-iB), (21) 

showing that electron transfer from high to low chemical 
potential gives lower energy. This is of course consistent 
with our requirement on a proper definition of the c.p. 



III. T ^ LIMIT OF fi AND A^ 

As noted in the Introduction, Perdew et aliiSiii de- 
rived (O for finite systems, allowing for exact treatment 
of electron-electron interactions. Here we discuss the 
derivation, generalizing it, and in addition, consider, for a 
semiconductor, taking the limit T —> after the thermo- 
dynamic limit (TL). It is well known that for the non- 
interacting-electron model (NEM), the two limits com- 
mute for the semiconductor, i.e. when the number of 
electrons exactly fills the valence band (see e.g. Section 
IV) . That this however is not true in general is rather ob- 
vious; it will be discussed below. We will also calculate 
the zero-T limit of A/". 

The model we consider is that of a number of elec- 
trons A^ that move in a fixed (1-electron or external) 
potential, which can depend on N. As mentioned above, 
we are thinking of a molecule in a general sense (it can 
be a finite solid); the number of nuclei is fixed and their 
positions are fixed at positions that minimize the elec- 
tronic energy in the Born-Oppenheimer sense. However, 
we will also at times consider the case where the poten- 
tial is fixed, independent of N, a very common model 
for a solid; here we need to understand that in taking 
the thermodynamic limit, the usual procedure is to con- 
sider a sequence of such "molecules" , in each of which 
both the average number of electrons and the number of 
nuclei increase in proportion. 

It is of course known that there is a maximum num- 
ber N,riax of electrons that will bind to a molecule 
and clearly there is a minimum number Nmin such that 
the molecule (with a given number of nuclei) is bound. 
Defining the unbound states as describing a system other 
than the molecule, we consider only the bound states in 
the statistical mechanics, as done inS^ for atoms. Thus 

■ ,_ E,Ej^r^_A^ea:p[-/3n,-(Ar)] 

j:exp[-(3n,{N)] ' ^ > 

where 

nj{N) ^ Ej{N) ^ HN. (23) 

Clearly the T — > limit will be determined by the lowest 
"free-energy eigenvalues" , i.e. the lowest values of fij ( A^) . 
For fixed potential, Nmin — 0- We will exclude the end- 
points Nq — Nmin, N„iax- In thosc cascs /i ±oo.— 

An important assumption made is that the ground 
states satisfy 

Ei{N-l)-Ei{N) > Ei{N)-Ei{N+l), N = 1,2,3, • •■ . 

(24) 

This condition, which states that the ionization energy 
is > electron affinity for any A^, and which is a convexity 
condition for the function Ei defined on the integers, is 
usually true. It is expected intuitively, but has not been 
proven, even for a system of electrons moving in the field 
of nuclei that are fixed in position, as noted by Lieb^. 
The intuition on which it is based is that it costs more 
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energy to remove a bound electron from an A''-electron 
system {I{N)) than to remove it from an 1-electron 
system {A{N)), since the leaving electron "sees" a larger 
positive charge in the former case. However, if the nuclei 
are allowed to relax from their positions in the iV-electron 
system, both Ei{N — 1) and Ei{N + 1) will decrease 
from their fixed-nuclei values, weakening the inequality; 
thus, even if (|24|l were true for fixed nuclei, the inequal- 
ity would not follow in general. Empirically it is true 
for all atoms; it is not known to be violated for isolated 
molecules^. Also, it is easily seen to be true for all non- 
interacting fermion models with fixed external potential, 
as well as the zero-bandwidth limit of the repulsive-U 
Hubbard model (but not attractive U— ). But it is ac- 
tively being questioned in connection with experiments 
on quantum dots and similar systems. See e.g.^°. We 
will content ourselves with considering systems for which 
convexity holds. 

As shown in-'^^, the minimum of fli{N) will occur for a 
given value A^o of A^, given H24|) and /i = /io, where 

^o = ^[^i(A^o + l)--Bi(A^o-l)]. (25) 

Also, that analysis shows this minimum is unique, 
given H24() . It is easy to see that for this fi 



n,{No - 1) - n.iNo) 



Q,{No + I) - n.iNo) (26) 
I{No) - AjNo) ^2^^ 



where I{Nq) and A{Nq) are the ionization energy and 
electron affinity for the Ao-electron system. (|26ll says 
f2i(A'^) is symmetric about A^o for the nearest neighboring 
values; such symmetry does not occur for jA" — A^o| > 1 
in general. One easily sees that the minimum of ri(A) 
also will occur at and only at A^o for a range of /x's such 
thali^ 



-I{N„) <fi< -A{No), 



(28) 



/io lying in the middle of this range. However, at the 
end-points there is precisely 2-fold degeneracy: 

r!i(Ao-l) = r!i(A^o), for /i = -/(TVo) 

r!i(Ao + l) = ni{No), for/i = -^(Ao). (29) 

From this it is clear that at T — 0, 

= -I{No) ^No-l<M<No, 
H = -A{No) ^ A^o < A/" < Ao + 1. (30) 

This is the same as Eq. (6) of^^. From this and (|22|1 wc 
can see further that 



r ^r , - for m = -/(Ao) 

-'''^^^ N, + J^ for/. = -A(A.o), (^1) 



where j± = g{No)/g{No±l), g{N) being the degeneracy 
of the ground level Ei{N). More generally, irtf it was 



argued that at low enough temperature, and for finite Nq, 
the three terms in (|22|l coming from the ground states 
{j = 1) for A^ = A^o ^ 1, Ao, Ao + 1 dominate the sums 
for A^o - 1/2 < A/" < A^o + 1/2^ In fact, this condition is 
overly restrictive, Aq — 1 <A/'< Aq-I-I being sufficient for 
the 3-state approximation. This approximation givesii, 
with the larger range of validity indicated, 

r -/(Ao), A^o-l< J^<No 

lim M = < -^(/(Ao) + AiNa)), A/" = Aq 

[ -A(A^o), Afo<AA<Ao + l 

(32) 

Note that the extended range in the 1st and 3rd cases in- 
cludes (|3Uf) (excluded inii), important in the transition 
region between integers (see Fig. The consistency of 
the argument can be seen as follows. For non-integer 
TV, in the given range, says that fi must approach 
— /(Ao) or —A{No), yielding the degeneracy indicated 
in (|29|l . In this case, these 2 degenerate states will domi- 
nate the sums, the 3rd state will be a finite energy higher, 
and therefore will not contribute in the limit. In fact this 
"2-state" approximation yields the first and third entries 
of eq. (52) inii (including the T-dependent term, the 
third state giving only an exponentially small contribu- 
tion). One needs the three states to simultaneously ob- 
tain the result 1)32(1 for both integer and non-integer M. 
For M = Nq (integer), the non- uniqueness of /x at T = 
seen in (|28|) is removed at infinitesimal T, the value being 
uniquely that of © (or (0). The restriction to finite Ao 
is needed for the argument that only the ground levels of 
each A^ enter at low enough T. 

As regards degeneracy of states with fixed A^, Perdew's 
resull^ shows that this only affects the approach to the 
limit ((32|l . Incidentally, the linear-T approach to the 
limit found inii is also valid under the less restrictive 
conditions indicated in H32|l . It is interesting to note that 
while the degeneracy does not affect the zero-T limit of 
/i (at fixed TV), it does affect the limit of TV (fixed /z), 
as seen in (|31|l . For the Hubbard model of the previous 
section, 7± = 2, and Fig. shows that the limit is nearly 
achieved for the temperature used there. 

To give insight into some of the results just obtained, I 
consider their meaning for some simple models. For the 
ideal intrinsic semiconductor^'^ 



Ei{No 
Ei{No + l) 
Ei{No) 



l)-EiiNo) 
-Ei{No-l) 
-Ei{No-l) 



Cc + Ed 



(33) 
(34) 
(35) 



If we define the gap in the many-body spectrum as G — 
E2{Nq) — Ei{Nq) then we get the familiar result, G — 
tc — €v Remembering that the 1-elcctron energies are 
negative (the electrons are bound in the solid), we see 
that / = Ei{No — 1) — -Ei(TVo) — — e„ > as appropriate 
for the ionization energy, similarly for the affinity. These 
are special cases of the general theorem 



Ei{N + 1) < Ei{N), 



(36) 
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valid beyond NEM's. Clearly I-A = €c-ey=G 
within these NEM's. However, one should realize the 
well-known fact that for an interacting system this rela- 
tion between G and I — A does not hold in general. E.g. 
consider the half-filled, zero-hopping, extended Hubbard 
model (including nearest-neighbor intersite Coulomb in- 
teraction V > Q), with the number of sites (or electrons), 
iVo > 2. For this, I - A = U while G = U -V. I - A 
is commonly called the "band gap" for intrinsic semicon- 
ductors. 

The important result (|32|l . particularly for TV integer, 
applies to any finite "molecule" . It is natural to ask if 
that result holds in general for a solid if the thermody- 
namic limit is taken before T ^ 0. I want to show that 
while the two limits commute in some familiar cases, they 
do not always do so. I'll accomplish this by examples. 
Recollect first that the thermodynamic limit for a solid 
considers a sequence of solids where atoms are added to 
the crystalline array keeping the volume per atom fixed (I 
will follow the common approach using periodic bound- 
ary conditions). Generally then one can take the band 
edges as unchanged in this sequence, while additional 
energies fill the space between. Consider the NEM of a 
semiconductor. For the intrinsic case, and with the num- 
ber of electrons — precisely the number of states in the 
valence band, the usual result is the case where the TL is 
taken first, sums over wave vector having been converted 
to integrals, with the zero-T limit fi = (ty + ec)/2; the 
opposite order of limits is the same, since the zero-T limit 
is given by (|32|) which is the same, and the TL changes 
nothing. Similarly for the case of a semiconductor with 
donor and acceptor impurity densities n^j > > (as- 
suming the impurity bands have zero width). Again the 
usual result gives the zero-T limit after the TL as^ /i 
the donor level as T ^ 0; and the zero-T limit for the fi- 
nite system, given by H32|l , is again the donor level, which 
doesn't change under the TL. However, let's return to the 
ideal intrinsic case and add an electron. The example in 
the Appendix shows that for ijl{Nq + 1) the two limits do 
not commute. 

As a final comment in this section, I give a "physi- 
cal" explanation of why /i goes to the middle of the gap 
as T ^ for an ideal intrinsic semiconductor. This is 
based on the non-interacting model with the assumption 
that the rate of transfer of an electron of energy ti from 
a semiconductor to an adjacent metal with c.p. /imet is 
proportional to |ei — /imetl'T-e, is the density of elec- 
trons in the semiconductor, with a similar statement for 
transfer of a hole at energy £2 from the semiconductor 
to the metal, with rig replaced by rih. At very low T, 
there are some thermally excited electrons in the conduc- 
tion band and holes in the valence band, with rig — rih- 
Now bring a metal with c.p. ^met into conducting con- 
tact with the semi-conductor with c.p. — iisc- In ac- 
cordance with the understanding of the chemical poten- 
tial as a quantity whose gradient drives particle flow, the 
condition for no current is Hmet = P^sc- Suppose ^sc 
is in the gap but above the middle of the gap. Then 



|ec— A*met| — ^c — fJ-met < — fJ-met] — Mmet " Cii , ProduC- 

ing a net flow of holes into the metal, thus contradicting 
the condition of no flow when the c.p.'s are equal. The 
only way for the electron and hole flow to be equal, and 
hence to have no net flow, is to have Cc — Mmet — ^J■met — f■v^ 
from which one obtains the desired result. 



IV. FINITE SIZE EFFECTS ON fi. 

In light of Shegelski's finding of huge finite-size effects 
on (f) even for samples as large as 1 cm'^ , we feel compelled 
to see if similar effects might occur with our definition of 
the c.p. I followed standard procedure within the ef- 
fective mass approximation, calculating fj, via sums over 
wave vectors k, the thermodynamic limit being obtained 
when these sums are replaced by appropriate integrals. 
The sum was done numerically for 1 and 2 dimensions; 
I show the results for the latter case, the other being 
similar. 

The usual effective mass single-particle spectrum is 

Ek = -h^k^/inih = el for ek<0 

= ec + h^P/2me = el for > 0, (37) 

with k = 2tt / {La){pixi + p2X2),Pi = 0, ±1, • • ■ , ±(L - 
l)/2. La is the linear size of the square lattice, L being 
taken as odd. Using the grand canonical ensemble, the 
following is solved for fj,: 

Y.[f{el-fi) + fiel-fi)]^No, (38) 

k 

where A^o = L'^- I took (2TT'^h^)/{mea'^) lOcV, which 
is roughly its value using the free electron mass, a = 3A 
and a gap of 0.5eV ; also I assumed mh/nie = 2. The 
results are given in Tableland in Figure |31 where just 



L 9 


17 


35 


81 


T 








,01 .25004 


.25244 


.25347 


.25348 


,005 .250005 


.250294 


.25171 


.25173 


,0025 .2500005 


.250005 


.25065 


.25087 


,00125 




.2500 


.25043 


,001 




.25003 


.25034 


TABLE I: fi 


vs T, L; 


/i,r in 


eV. 



the two largest sizes are plotted (I used the remarkable 
Arbitrary Precision command of MathematicaSS) . For 
the largest L, the results agree with the T. L. in 2-D 

fioo ^kc + kTln^), (39) 

2 TOe 

essentially perfectly down to T = .001, i.e. T ^ 
lOK. (Actually there are exponentially small corrections 
to H39|l . which are negligible here.) 
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0.254 



.253 



a. 0.252 



0.251 



L=81 



L = 35 



0.002 0.004 0.006 0.008 0.01 



FIG. 3: Chemical potential /i as function of T and L. jj,, T in 
eV. 



That this is for a system size of less than 7000 unit cells 
shows that our definition of the c.p. essentially reaches 
the TL for systems far smaller than "macroscopic" . 

It is seen that finite size effects lead to a reduction 
in fi/ Hoo at low T. Further, the points for L = 35 ap- 
pear to be heading to at finite T (they don't actually!). 
These facts can be understood as follows. Basically the 
reduction occurs when T becomes less than the spacing 
between levels near the band edges. The seeming phase 
transition at T ~ .001 in the L = 35 case arises because 
of the essential singularity at T = of ea;p(— Tq/T). To 
see this, we plot e~^/* in Figure 0] which is seen to show 
a kind of critical value tc such that the function appears 
to be zero for t < tc, with tc ~ 0.1. The level spacing for 




0.05 0.1 0.15 0.2 0. 25 



FIG. 4: e"^/* vs t 



the conduction band is 10/L^ .008 for this case, which 
would give Tc ~ .0008, close to the observed value .001 . 



V. FINAL REMARKS 

We have argued that the proper ensemble for consider- 
ing the statistical thermodynamics of finite size systems 
is the grand canonical ensemble whenever the chemical 
potential is physically relevant, and that the uniquely 
correct definition ^ of the chemical potential is the fj, 
that is usually written in that ensemble's density oper- 
ator {v in Q). This definition satisfies the historical 
thermodynamic definition (|18|l . the differentiation being 
defined because the average value of the number of par- 
ticles Af in the grand canonical ensemble is continuous. 
This obviates the need for guesses such as 0, ip where one 
tries to define the c.p. as some functional of a free energy 
function defined only on the integers, a need that occurs 
for the other standard ensembles. We note that this is 
a rather serious departure from the usual view of sta- 
tistical thermodynamics, where a fundamental equation 
obtained from any ensemble is thought to give all fun- 
damental equations. We discussed small-system experi- 
ments that support the proposed definition, and showed 
that this definition avoided the "bad" huge finite-size ef- 
fects found by Shegelskii for the ideal intrinsic semicon- 
ductor using (f) as the definition of the chemical potential. 

It should be realized that since I have defined finite 
systems as systems with anywhere from a few to ~ 10^"^ 
particles, for most 'finite' systems on the large end of this 
range the difference between the various ensembles will 
be negligible, the free energy will be extensive and the 
various quantities fj,, (f>, will be essentially the same. It 
is for the smaller systems, and what may be considered 
the anomalous ideal semiconductor model, that my con- 
clusion is relevant. For a small system S in contact with 
a reservoir R, one can of course consider the whole sys- 
tem, S + R as SL new system. If R is macroscopic and 
not anomalous, the various ensembles and definitions of 
the c.p. will be equivalent for S+R. But this procedure 
entails consideration of the reservoir; the advantage of 
using a statistical description of S alone is that one can 
avoid consideration of reservoir details. This point was 
illustrated in the small-system examples presented above 
(the lead in the quantum dot example of course being the 
reservoir). 

Despite the fact that the definition (j> of the c.p., used 
by many, is unacceptable, it is a well-defined quantity. 
Shegelski's finding^ that it shows huge finite size effects, 
should give one pause in making the usual rather univer- 
sal assumption that the TL of any quantity observable in 
principle is well achieved for usual macroscopic systems. 
In particular, the argument that the definitions (f), V are 
justified for macroscopic (and ideal intrinsic) semicon- 
ductors is proved false by this finding. Because I found 
this phenomenon quite surprising, I give in the Appendix 
a relatively simple discussion that I believe makes it plau- 
sible. 

Interestingly, the result found there shows that, al- 
though using the correct (i.e. the proposed) definition 
of the c.p. avoided the bad behavior of cj), similar "bad" 
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behavior is shown for this correct definition in the case 
of an added electron. It is important to note that this 
case is quite special, the Fermi level ^ sitting at a sin- 
gularity in the density of states in the limit of zero tem- 
perature. Whether or not surprising behavior will show 
up at other types of singularities, most macroscopic sys- 
tems will show a behavior of fi consistent with the usual 
expectation that it is very close to the thermodynamic 
limit even at very low temperatures. E.g., in a metal, 

very low means > separation of 1-electron energies (typ- 
ically > {10^ /N)K , to within a few factors of 10 either 
way), tiny indeed, even in 2 dimensions where the num- 
ber of atoms N is around 10^^. In this case the distinc- 
tion between and fi essentially vanishes. Similarly, 
the presence of realistic amounts of impurities in a semi- 
conductor tends to remove the distinction between these 
quantities, as noted by Shegelski^-^ (surface states can act 
similarly). But for small systems the distinction is un- 
avoidable, leading me to consider the very-small-system 
examples discussed above as the physically most impor- 
tant ones, although the ideal intrinsic semiconductor is 
of considerable pedagogical importance (it's where one 
begins any study of semiconductors). 
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APPENDIX A: CHEMICAL POTENTIAL FOR 
AN ELECTRON ADDED TO AN INTRINSIC 
SEMICONDUCTOR 

Working in the grand canonical ensemble, to calculate 
the term J-{Nq + 1) that appears in 4>{Nq), one requires 
the chemical potential fi{No + 1). {No is the number of 
electrons in the neutral semiconductor.) fJ.{No + 1) is 
simpler to calculate than T{Nq -f 1), and fortunately it 
shows huge finite size effects like those found-l for (f>{No). 
I expect that this behavior of Iji{Nq -|- 1) is the origin of 
the behavior of (j){No), and therefore present its calcula- 
tion here with the purpose of making Shegelski's result 
plausible to the skeptics (of which I was one). According 
to Shegelski, the source of the slow convergence to the 
TL is a logarithmic dependence on the volume or Nq, so 



we will be looking for this. 

I consider temperatures >> the separation of levels 
near the bottom of the conduction band, in which case 
we can replace sums over k by integrals. Writing 

mf^) = / f{y-f^)dy+-^ / /(y-M)rfy, (Ai) 

^ Jo ^ JA+G 

the equation determining fJ.{N() + 1) is then 
M{ti{No + 1)) = A^o + 1. 



(A2) 



In I|A1|I . A is the width of the valence band, G is the 
gap, and Nq/A = D,No/A' = D' are the valence and 
conduction band densities of states (I'm considering di- 
mensionality 2). The integrals are elementary, giving 



J\f{fi) = No + DkTl 



1 + e-f^f 



1 + g-/3(M-A) 

+D'kTln[l + e-^'-'^+^-'''^]. (A3) 

We're interested in kT << G < A. As a check, if we 
equate this expression for M{fJ.) to Nq and assume > A 
we find (Eg for ^i{No). yields 

(A4) 

where ii{No + 1) = fi'. For simplicity, take A = A'. 
Then 

can be written as a quadratic equation in 
e^'^ , which gives finally 



fi' = A + G + kTln 
where x = e^^^/^o. 



X-1 + V(2^-lF+4arexpP/3G) 

2 ' 
(A5) 



fj(No + l) -ee 




FIG. 5: Deviation of the chemical potential from the bottom 
of the conduction band, ^{Nq + 1) — Ec, in units of the gap, 
G=0.25eV, as a function of T for iVo = 10** and 10^^ T in 
eV. 



EH 



gives 



^im ^l{No + 1) = A G A/7Vo, (A6) 
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and 

lim ^(iVo + 1) = A + G/2. (A7) 

A^o^c>o 

The zero-T limit is close to the bottom of the conduction 
band, as expected. This hmit is questionable since we 
assumed T » the separation of conduction band levels, 
but see below for justification. The TL (|A7|) is where it 
must be, in the middle of the gap, because the difference 
in the numbers of electrons and holes is just 1 (in Nq). 

To look more closely at the approach to the TL, we 
note that x — \~ /3A/No. Hence /i is analytic at 1/A^o ~ 
0, according to (|A5(I . I.e., there is no logarithmic Nq 
dependence seen here. However, we note that the two 
terms under the square root compete at large A'o and 
small T: At small enough T, fixed Nq, the term (x — 1)^ 
dominates, while at large enough A'o the other term wins. 
Let Tq be the temperature at which the two terms are 
equal. Putting 

z = PoG/2 (A8) 
p - iVoG/(2A), (A9) 

To is determined by e^/^ — = 1 or 

z In z = In p, (AlO) 

for p >> 1 and z >> 1. This says that z k, In p a,s 
p oo, and, further;^ 

z ^ In p - In In p + 0{ ^^^^^ ^ ). (All) 

In p 



Thus, not only does kTo/G{^ 1/z) approach as 1/lnp, 
but the corrections to this asymptotic behavior become 
small very slowly. Thus we have seen just how logarith- 
mic behavior enters. 



For a quantitative illustration, we have plotted (|A5() 
in Fig. El taking G = 0.25ey, A = lOeV. It is seen that 
the TL, namely fJ,{NQ + 1) being at the middle of the 
gap, is approached for high T, and moves to the top of 
the gap at T 0. It can be checked that the temper- 
ature To, below which deviation from the TL begins to 
get big, tracks well with the first two terms of ljAll|l . 
For the larger value Nq — 10^^, which is macroscopic 
(D=2), To - 0.005ey ~ 50K, so the TL is largely in er- 
ror for T ^ 25Km^ This is very similar behavior to that 
found by Shegelski-*^ for (/)(iVo), but 1 emphasize that the 
quantity (j-^Nq + 1) studied here is the true chemical po- 
tential for the Nq + 1-particle system, which is not (l){No). 
However the two quantities arc related in that fJ-{No + 1) 
enters directly the calculation of J-{No -1-1), which en- 
ters directly into the definition of 4>{No), making it very 
plausible that the large finite size effects in (j) (for macro- 
scopic samples!) found by Shegelski are real. One should 
remember that this 4> is not the chemical potential for 
the neutral semiconductor, as I have argued above. Nev- 
ertheless, it is well defined, and its large deviation from 
the TL^ renders incorrect the widely quotedii2iSii2*i2ii^ 
0, and similarly^^'^^ Tp (I presume), as representing the 
chemical potential for this neutral system. 



1 M. R. A. Shegelski, Solid State Commun. 38, 351 (1986). 

2 M. R.A. Shegelski, Am. J. of Phys. 72, 676 (2004). 

^ But later in his papers these quantities may be calculated 
either in the canonical or in the grand canonical ensemble. 

* C. Kittel, Introduction to Solid State Physics, John Wiley 
and Sons, New York (1971). 

^ N. W. Ashcroft and N. D. Mermin, Solid State Physics, 
Saunders College, Philadelphia (1976). 

® This was called an ideal intrinsic semiconductor^; it is the 
usual textbook model where noninteracting electrons move 
in a periodic potential, treated by periodic boundary con- 
ditions (there are neither impurities nor surface states). 
J. L. Lebowitz and E. H. Lieb, Phys. Rev. Lett 22, 631 
(1969). 

* The condition T > must be understood: the TL of 
can have a discontinuity at T = 0, as in the semiconductor 
model just discussed. I.e., (j> (and V of Q can depend on 
the order in which the TL and the limit T — » are taken. 

^ H. B. Callen, Thermodynamics, John Wiley and Sons, New 
York (1960). 

P. T. Landsberg and D. C.Browne, Solid State Commun. 
62, 207 (1987). 

^ C. Kittel and H. Kroemer, Thermal Physics, W. H. Free- 
man and Co., New York (1980). 
^ Fetter, A. L. and Walecka, J. D. (1971) Quantum Theory of 



Many-Particle Systems, McCraw-Hill, New York-London. 
T. L. Hill, Thermodynamics of Small Systems, W. A. Ben- 
jamin, Inc., New York (1964). Chap. 15. 
P. M. Chaikin and T. C. Lubensky, Principles of condensed 
matter physics, Cambridge University Press, Cambridge, 
UK (1995). 

^5 R. Baierlein, Am. J. Phys. 69, 423 (2001). 

J. P. Perdew, R. G. Parr, M. Levy, J. L. Balduz, Jr., Phys. 
Rev. Lett. 23, 1691 (1982). Regarding Eq. (10) of this pa- 
per, one should understand that the cases N = Z ±1 are 
outside the stated allowed range (and are therefore incor- 
rect for general Z). The result for N = Z is correct, and 
corresponds to 0, Q, and the = No case of l)32|l . 
present paper. 

^"^ J. P. Perdew, in Density-functional methods in Physics, 
edited by R. M. Dreizler and J. da Providencia, Plenum 
Press, New York, 1985. p. 265. 

Non-integral average values of the number of particles were 
also considered iniSi— . 
T. A. Kaplan, Phys. Rev. A 7, 812 (1973). 
Erwin Schrodinger, Statistical Thermodynamics, Cam- 
bridge University Press (1952). 

R. Parr, R. A. Donnelly, M. Levy and W. E. Palke, J. 
Chem. Phys. 68, 3801 (1978). (1990) 
22 R. S. MuUiken, J. of Chem. Phys. 2, 782 (1934); ibid. 3, 



12 



573 (1935). 

E. P. Gyftopoulos and G. N. Hatsopoulos, Proc. Nat. Acad. 
Sci. USA, 60, 786 (1968). This works states that © follows 
from but doesn't mention a convexity requirement (the 
result does not follow if (1241 is replaced by a concavity 
condition) . 

L. Pauling, The nature of the chemical bond, Cornell Uni- 
versity Press, New York, (1960). 

R. C. Tolman, The Principles of Statistical Mechanics, Ox- 
ford University Press, (1938), §140 (b). 

^® L. D. Landau and E. M. Lifschitz, Statistical Physics, 
Chap. XV, Pergamon Press, New York (1969). 

■^^ Magnetic properties of systems of localized spins are ex- 
amples where the chemical potential may not be physically 
relevant. 

■^^ Shegelski- has argued that the canonical ensemble is most 
appropriate for a semiconductor, since it has a fixed num- 
ber of electrons. I fault this argument by noting that, phys- 
ically, fluctuations in the number of electrons will occur in 
semiconductors as well as in metals. This is particularly 
true for the situation where the semiconductor is put in 
contact with a piece of metal in order to consider electrical 
current. Furthermore, these (relative) fluctuations for the 
grand canonical ensemble within the noninteracting effec- 
tive mass model can be seen to be negligible for a macro- 
scopic semiconductor. 

^® For discussion of the use of the canonical and grand canon- 
ical ensembles in an example even where the average num- 
ber of particles is fixed, see D. Cabib and T. A. Kaplan, 
Phys. Rev. 7 2199 (1973). Erratum: the -|- in the line just 
below eq. (4) should be -. 

^° The formal generalization of 1191 and 1181 to a multi- 
component system is straightforward. One has TV,; particles 
of type i, fj.N in p3|l is replaced by ^ ^liNi, H18|l is replaced 
by dT jdNi = [li. Mi is the expectation value of Ni, and 
the derivative is taken holding the other A/} constant. 
S. R. De Groot, Thermodynamics of irreversible processes, 
North-Holland Publishing Co., Amsterdam (1951). 

F. J. Blatt, Physics of electronic conduction in solids, 
McGraw-HiU Book Co., New York (1968). 

D. K. Ferry, and R. O. Grondin, Physics of submicron de- 
vices. Plenum Press, New York and London (1991). 
C. Cohen- Tannoudji, J. Dupont-Roc, and G. Grynberg, 
Atom-Photon Interactions, John Wiley and Sons., New 
York, Chapter IV (1992). 

S. Tarucha, in Mesoscopic Physics and Electronics, editors 



T. Ando, Y. Arakawa, K. Furuya, S. Komiyama, and H. 
Nakashina, Springer- Verlag, Berlin, Chapter 2.4. (1998). 
R. C. Ashoori, Nature 379, 413 (1996). 
R. C. Ashoori, H.L. Stormer, J. S. Weiner, L. N. Pfeiffer, 
S.J. Pearton, K.W. Baldwin, and K. W. West, Phys. Rev. 
Lett. 68, 3088 (1992). 

X. Hu and S. Das Sarma, Phys. Rev. A 61, 062301 (2000). 
'"^^ J. Schliemann, D. Loss, and A. H. MacDonald, Phys. Rev. 

B 63, 085311 (2001). 
""^ T. A. Kaplan and C. Piermarocchi, Phys. Rev. B 70, 

16131 1(R) (2004). 

M. A. Kastner, Physics Today 46, 24 (1993); Rev. Mod. 
Phys. 64, 849 (1992). 
''^ L. P. Kouwenhoven, D. G. Austing, S. Tarucha, Rep. Prog. 
Phys. 64, 701 (2001). 
S. Tessmer, priv. comm. 

There is another source of peak width in addition to the 
thermal effect considered here, namely that due to the hop- 
ping of electrons between the levels of the dot and the over- 
lapping continuum of the leads. Under certain conditions 
the thermal broadening can be the dominant effect. See—. 
""^ O. Steffens, U. Rossler, M. Suhrke, Europhys. Lett. 45, 
529 (1998). 

*® For a more realistic but still simple description of a quan- 
tum dot, see Kastner's Physics Today article^^. 
E. H. Lieb, Phys. Rev. A 29, 3018 (1984). 
E. H. Lieb, in Density- functional methods in Physics, 
edited by R. M. Dreizler and J. da Providencia, Plenum 
Press, New York, 1985. p. 11. 
J. F. Harrison, private communication. 

^° M. E. Raikh, L. L Glazman, L. E. Zhukov, Phys. Rev. Lett. 
77, 1354 (1996). 

This is equivalent to the theorem of referencai- for NEM's. 
This disagrees with the condition given ir/4 for the domi- 
nance of these terms. 

G. Zhislin, Trudy Moskov Mat. Obsc. 9, 81 (1960); T. A. 
Kaplan and W. H. Kleiner, Phys. Rev. 156,1 (1967). 
N. B. Hannay, in Semiconductors, edited by N. B. Hannay, 
Reinhold Publishing Corp., New York (1959), p.31. 
S. Wolfram (1991, reprinted in 1993 with corrections). 
Mathematica, Addison- Wesley, New York. 
N. G. De Bruijn, Asymptotic Methods in Analysis, section 
2.4., Dover Publications, New York (1981). 
These temperatures are high enough to justify the replace- 
ment of the sums by integrals. 



